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Abstract 



Recently Hastings |16j proved the existence of random unitary channels which violate 
the additivity conjecture. In this paper we use Hastings' method to derive new bounds for 
the entanglement of random subspaces of bipartite systems. As an application we use these 
bounds to prove the existence of non-unital channels which violate additivity of minimal 
q-i output entropy. 
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1 Introduction 

In his 2008 paper [16] M. Hastings proved the existence of channels which exhibit non-additivity of 
minimal output entropy. This result settled a long-standing open problem in quantum information 
theory Hastings' paper is interesting from many points of view, not least because it introduced 
some essentially new ideas into the field of random channels. To review the history a little, in earlier 
work Hayden, Leung and Winter [17] had derived bounds for the entanglement of random subspaces 
of bipartite spaces (these bounds are recalled below in Theorem [1]). They used concentration of 
measure arguments to analyze the entropy of random states in high- dimensional spaces, together 
with the "e-net" method to control the entropy of all states in a subspace. Their analysis led to the 
proofs by Hayden and Winter [18j of the existence of channels violating additivity of Renyi entropy 
for all p > 1. Further progress in this direction appeared in the recent work of Collins and Nechita 
[8j [9] on Renyi entropies of entangled states and subspaces. However the p = 1 case remained open 
until Hastings provided the new ingredients to complete this program. 

Our goal in this paper is to apply these new methods from the paper [16] to the analysis of 
random subspaces of bipartite spaces. As an application we derive results about the entanglement 
of a generic high- dimensional subspace, and show that in some regimes this provides strictly tighter 
bounds than the Hayden, Leung and Winter estimates. We also use these bounds to deduce the 
existence of non-unital channels which violate additivity of minimal output entropy, and in fact 
show that such violation is generic for high-dimensional channels. In the process of deriving these 
results we formulate an abstract version of Hastings' method, and we believe that this formulation 
will be useful for the study of other generic properties of random subspaces. 

The idea of using Hastings' method to study entanglement of random subspaces also appeared 
recently in the work of Brandao and Horodecki [5]. Their work is particularly interesting because 
it uses a combination of standard concentration of measure arguments together with some of the 
new ideas of Hastings. There is some overlap between their paper and ours, and in particular we 
re-derive their entanglement bound as a special case of our Theorem [2J However we also extend 
their results in several ways, both by considering different dimensions for input and output spaces, 
and by presenting explicit bounds for the size of the additivity violations. 

The paper is organized as follows. In the rest of this Introduction we recall the entanglement 
bounds derived by Hayden, Leung and Winter, and state the new bounds derived using Hastings' 
method. We then use these bounds to prove the existence of a new class of channels with non- 
additive minimal output entropy. Section 2 contains the main result of this paper, which is a 
general formulation of the Hastings bound. With an eye to possible future applications we state 
this as broadly as possible, namely as a condition which guarantees convergence to zero of the 
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probabilities of a sequence of events in the output space. In Sections 3 and 4 we use the Hastings 
bound to derive the entanglement results in Section 1. In Section 5 we prove the Hastings bound, 
using methods similar to those in the paper [13] . Finally the Appendix contains some technical 
estimates needed for the derivations of the bounds. 

1.1 Entanglement of subspaces 

Consider a subspace C of the bipartite system A ® B. The entanglement of C C A <g> B is defined 
to be 

E{G)=iris(T± B \4>)(4$) (i-i) 

where the infimum runs over normalized states |0) in C, S(-) is the von Neumann entropy, and 
Ttb\<P)(<P\ is the reduced density matrix of the orthogonal projector onto \<f>). Note that E(C) > 
with equality if and only if C contains a product state. 

In the search for counterexamples to additivity, one is interested in finding subspaces with 
large entanglement. Thus the quantity sup E(C) is of interest, where the supremum runs over 
all subspaces of a fixed dimension. This supremum depends only on the dimensions of A, B, C. 
Let d = Dim A, n = Dim B and s = DimC, then the maximum entanglement of a s-dimensional 
subspace in A ® B is 

E max (s,d,n) = sup{E(C) : C C A® B, DimC = s} (1.2) 

Hayden, Leung and Winter [17] obtained the following lower bounds for E max . 

Theorem 1 (Hayden, Leung and Winter) Assume that 3 < d < n. Then 

d fs + l\ 2/5 

E max (s,d,n) > logd - c x c 2 — — logd (1.3) 

n \ dn J 

where c\ ~ 1.44 and c<i ~ 19.84. 

The main results we present in this paper are new lower bounds for E max (s, d, n). The bounds 
are valid for sufficiently large dimensions n and s, and for any dimension d > 2. Theorem [2] concerns 
the case where s scales linearly with n, and Theorem [3] covers the case where s/n->0asn-> oo. 

In order to state our first result we need to introduce the solution of the following optimization 
problem: for x, y > define 



z log z + {d- z) log feu -logz-(d-l) log ( 
h d (x,y) = inf inf <J - ■ ^ V ' : -— >- = y } (1.4) 



0<7<1 z>l [ x (7 + (1 — 7) log(l — 7)) — log(l — 7) 

Theorem 2 Let d > 2, < r\ < r^, and h > hd{ri,r2)- There is uq < 00 such that for n > hq, 
and all s satisfying 77 < s/n < r-i, 

E mgx (s,d,n) >logd-/i^J (1.5) 
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The above result is a generic property, meaning that with probability approaching one as 
s, n — > oo, the right side of (11.51) is a lower bound for the entanglement E(C) of a randomly 
selected subspace C. It is possible to analyze the function hd in detail but for our purposes here it 
is sufficient to note that it satisfies an upper bound which is uniform in d. As mentioned before, 
using related methods Brandao and Horodecki [5] proved Theorem [2] in the case r\ — T2 — 1- 

Our second result concerns the case where s/n — > 0. Define 

h = inf -log(l -7) ~ 3 . 351 n 6 ) 

0<7<1 7 + (1 - 7) log(l - 7) 

Theorem 3 Let d>2, and h > h . Consider sequences {s k ,n k } such that 

s k n k logg fc 

>• U, > as k — > 00 (1.7) 

n k sl /2 

There is ko < 00 such that for k > ko, 

E max (s k ,d,n k )>logd-h(—^) (1.8) 

\n k d/ 



Again we note that the lower bound in (I1.8P is generic for random subspaces in high dimensions. 
The bounds (11.51) . (11.81) and (11.31) can be compared for small values of the ratio s/nd. It can be 
seen that the right side of (ll.5p . (I1.8P behaves like logo? — c(s/nd), while the right side of (11.31) 
behaves like \ogd — c'(s/nd) 2 ^ 5 logo! for some constants c, c'. Thus (11.51) . (11.81) provide a sharper 
bound in the regime s << nd. 

Corollary 4 Let d > 2, h > h and < e < 1/2. Then, there exists s such that 

E max (s, d, sd) >\ogd-h ( ^a-^ ) ( L9 ) 

for all s > s . 

To see this, let n = \s 3 ^ 2 ~ e ~\ and then Theorem [3] implies that there exists Sq such that 

E^s, d, \s^]) >\o g d-h (p^) > bgd- h (^) (1.10) 

for s > So- Without loss of generality, we can assume that d < s 1 J 2 ~ t , which implies sd < |~s 3//2-e ] for 
s > Sq. As described in the next section, there is a correspondence between subspaces of bipartite 
spaces and quantum channels. Thus a subspace which satisfies the bound in (11.101) corresponds to 
some quantum channel where the dimensions of input and output spaces and the number of Kraus 
operators are s, d, \s 3 ^ 2 ~ £ ~\ respectively. However, this channel can be rewritten by using at most 
sd Kraus operators [31], [28]. Therefore this channel corresponds to some s-dimensional subspace, 
say C, of C d <S> C sd , for which E(C) satisfies the same lower bound (ll.lOp . 
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1.2 Violations of additivity 

The subspace C C A <g> B is defined by an embedding W : C s — > C d <g> C n satisfying W*W = I, 
where C is the image of W. This embedding defines two conjugate channels $vk and $^ via 

$ w (p) = Tr cd WpW*, $^(p) = Tr C nWpW* (1.11) 

Letting denote the complex conjugate of the matrix W, the complex conjugate channels $vk 
e> 

and $ w are defined by 

§ w (p) = Tr&WpW*, $w(p) = Tr C nWpW* (1.12) 

It follows that 

£(C) = S min ($w) = S min (®w) (1.13) 

Our next result gives a universal upper bound for the minimum entropy of any product channel of 
the form $ Cg> $, depending only on the dimensions of the spaces (a similar bound was derived in 
[5] for the case s = n). 

Theorem 5 Let p = s/dn, and assume that sd/n > I, then 

S^S®!) < (1-p) log(d 2 -l)-plogp-(l-p)log(l-p) (1.14) 

Theorem [5] will be proved in the Appendix. We will now use Theorems [2] and [5] to demonstrate 
the existence of channels of the form $ ® $ violating additivity For such a product channel the 
violation of additivity is given by 

A5($) = 5 min ($) + S min ($) - S min ($ ® $) = 2S min ($) - S min ($ ® ¥) (1.15) 

Theorem [2] guarantees the existence of subspaces satisfying the bound (11.51) . and hence also the 
existence of channels $ for which 5 , m i n ($) satisfies the same bound. Taking s = n (so r% = T2 = 1), 
this implies the existence of channels for which 

5 min ($)>logrf-^^ (1.16) 

for sufficiently large n. Combining the bounds fll . 16j) and (I1.14p . and estimating log(<i 2 — 1) < 2 logrf 
in f)1.14p . we obtain 

A5($) > p \og(pd 2 ) + (1 - p) log(l -V)-\ h d (h 1) (1-17) 

where p = s/nd = 1/d. Using (I1.17P and the inequality (1 — p) log(l — p) > —p, we get for 
sufficiently large n 



logd-2^(l,l)-l 



;i.i8) 
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For d > exp[2/id(l, 1) + 1] the right side of (11.181) is positive and hence these channels violate 
additivity (recall that hd(l, 1) is upper bounded uniformly in d). Furthermore, the method of proof 
shows that this violation occurs with positive probability for a randomly selected subspace, and 
hence for a randomly selected channel. Since the unital channels have measure zero in the set of all 
channels of fixed dimensions s, n, d, this implies the existence of non-unital channels which violate 
additivity. 

2 The Hastings bound 

This section contains an 'abstract' version of the Hastings bound. Much of the notation was 
introduced previously in [T3j, and we will use several technical results from that paper. 

2.1 Notation 

M n will denote the algebra of complex n x n matrices; the identity matrix will be written /; U(n) 
will denote the group of unitary matrices. The set of states in M n is defined as 

S n = {p G M n : p = p* > 0, Trp = 1} (2.1) 

The set of pure states in M n is identified with the unit vectors in C n and denoted 

V n = {|^>G<C" : (V#> = 1} (2.2) 

We write TZ(s,n,d) for the set of all embeddings W : C s -> C d <g> C n , with W*W = I. There 
is a one-to-one correspondence between such embeddings and pairs of complementary channels 
<£> w : M s -> M n , $^ : M s -> M d defined by 

<f> w (p) = Tr C dWpW* , <S>g(p) = Tr C nWpW* (2.3) 

Thus TZ(s, n, d) is also the set of all such pairs of conjugate channels. The image of the pure input 
states under the action of a channel <3> c : M s — » Md will be denoted 

Im($ c ') = {$ C (|0)(0|) E S d : |0) E V s } (2.4) 

2.2 Random embeddings 

We define a probability measure V s ^ n ^ on the set of embeddings lZ(s, n, d) as follows. Let W be a 
fixed embedding W : C s -> C d ® C n satisfying W * W = J. Then every embedding W G 7?.(s, n, d) 
can be written as 

W = UW , U*U = I (2.5) 

for some unitary matrix U G U(nd). Let Stab(W / o) be the subgroup of unitary matrices which leave 
invariant every vector in the image of the embedding Wq. Then two unitary matrices Ui, U-z define 
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the same embedding if U^U 2 G Stab(W ). Thus K(s,n,d) can be identified with the left cosets 
of the group of unitary matrices with respect to the subgroup Stab(W / )- Let IT be the projection 
from U(nd) onto these cosets. Then the normalized Haar measure Haar on U(nd) descends to a 
normalized measure II* (Haar) on this set of cosets, and this defines our probability measure Vs,n,d 
on 7Z(s, n, d). 

2.3 Definition of the tube 

We recall the notion of the 'tube' at a state p, as defined in [TJ]. First, for any p G Sd and < 7 < 1 
define L 1 (p) to be the following line segment pointing from p toward the maximally mixed state 
I/d: 

L y (p) = jrp + (1 -r)-I : 7 < r < l} (2.6) 

Then the tube at p is defined to be the set of states which lie within a small distance of the set 
L 1 (p). Also for any event C C Sd, the tube at C is the union of the tubes at all states in C. 

Definition 6 Let p G Sd, then the Tube at p is defined as 

Tube(p) = jfl G S d : dist{6, L 7 (p)) < 2 + 13 d ^/^} (2-7) 

where dist(8, L(p)) = inf rg £( p ) \\8 — r||oo- For any output event C C Sd the Tube at C is defined as 

Tube(C) = [J Tube(p) (2.8) 

pec 

2.4 Statement of the bound 

Suppose that for each triplet (s,n,d) there is given an event C(s,n,d) C Sd- We want to find 
conditions which will show that for sufficiently large dimensions s, n there is a nonzero probability 
that for a randomly selected embedding W the event C(s,n,d) will not contain any output states 
of the form $^(|0)(0|). We analyze this by considering the sequence of complementary events, 
namely the events {W : Im($^) fl C(s,n,d) 7^ 0}, and showing that their probabilities approach 
zero as s, n — > 00. 

Theorem 7 (The Hastings Bound) Let {C(s,n,d) C Sd} be a collection of output events de- 
fined for each triplet of dimensions (s,n,d). Fix d > 2, consider sequences {sk,n k } — * oo ; and 
define 

B k = {We TZ(s k , n k , d) : bn($gr) fl C(s k , n k , d) ^ 0} (2.9) 
Suppose there is 7 G (0, 1) such that 

lim ( d 2 \ogn k + n k d\ogd + (n k - d)M{j, k) - s fc log(l - 7) ) = -00 (2.10) 
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where 

M(rf,k) = sup{7Hogp : p G Tube(C(s fc , n k , d))} (2.11) 

Then 

lim V Sk , nk , d (B k ) = (2.12) 

k— >oo 

As a consequence of the Theorem, if the conditions are satisfied then for sufficiently large k 
we have V Sk , nkjd (B k ) < 1, and hence there must exist embeddings W such that ^ 
C(sk, n k , d) for any input state \<p). Theorem [7] will be proved later in Section 5. First we use it to 
deduce Theorems [2] and 



3 Proof of Theorem 2 



In this section we apply the Hastings bound to prove Theorem [2j Fix dimension d and the parameter 
h. In the following, we consider a sequence of integers n large enough so that we can choose s = s(n) 
satisfying r\ < s/n < r 2 for each n. We will prove the existence of an integer n such that (11.51) 
holds for all n > n , where no will not depend on the choice of s(n). Define 

C(s,n,d) = [ P eS d : S(p) <logd-/i(^)J (3.1) 

Let Aj be the eigenvalues of p, then 

1 d 

S(p) - log d = -- ^(Aid) log(A^) (3.2) 



i=l 



Define 



then it follows that 



f(x) = x logx — x + 1 (3.3) 



C(s,n,d) = {peS d : Yj(\id)>hl-) (3.4) 



^ i=i 



Next define 



F(a;) = — log x + x — 1 (3.5) 

and note that for any 9 G S d 

d 

dlogd + Trlog# = -J2 F ( i d ) ( 3 - 6 ) 

i=i 
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where {9{\ are the eigenvalues of 9. Thus recalling the definition (12.111) it follows that 

d 

d\ogd + M(j,n) = -inf {j^F(M) : 6 e Tube(C(s, n, d))} 



i=l 



Now from definitions (12. 7p . (12. 8p it follows that if 9 G Tube(C(s, n, d)) then for some r e 



1 d 

i = ;2 i + e i , ^ = r Aj + (1 - r) -, = 



d' 

i=l 

where the eigenvalues Aj satisfy 



i=i 

and where 



k|<2,/^ + 13d /l0grf 
n 



The Fannes inequality [TT], [3], [TJJ implies that 

d d 



E /(M) - ^ /(*d) < 77 = d e m (log d + log — ) 



i=l i=l 



where 



i=l 

Note that r/ — * as n, s — > oo. 

We now apply Lemma 12 from [TJJ which says that for all x > and all r e [7, 1] 

/(rx + l- r) 
f[) - /(I -7) 
Applying this to (13.91) . (13. lip we deduce that 

^/(M)>Mi-7) (-)-»? 

i=l 

Thus we finally arrive at the inequality (putting Xj = 9 id) 

E : E > w - 7) ( £ ) - 4 

i=l i=l n J 



Define 

d d 

m d {y)= d inf_ {^F( Xi ) : ^/(a*) >y] (3.16) 

then (I3.15P can be written 

d log d + M( 7 , n) < -m d (hf{l- 7) (~) - v) (3.17) 

In Section 5.7 of [H] the following identity was derived: 

d — z , , , , , d — z 



(y) = jnfj -logz- (d- l)log— -y : 2 log z + (d - z) log ^— y = y j (3.18) 



It was also shown in [2] that the function m d is increasing and hence has an inverse rn^ 1 . Given 
y > there is a unique z > 1 satisfying z log 2 + (d — z) log = y, and this function also has an 
inverse. Thus 

d — z 

m~^(w) = z log z + (d — z) log - — - (3.19) 

where z is the unique solution of — log z — (d — 1) log^5f — w - Since both of these functions are 
increasing this can be written as the minimization 

f d — z d — z ~) 

ml 1 {w) = inf < z logz + (d — z) log - — - : — log z — (d — 1) log - — - = w \ (3.20) 
z>i I a— 1 a — 1 J 

Thus recalling (II. 4p we have 

^(x, y) = q inf t ^T^y m d 1 ( - - T) ( 3 - 21 ) 
Let 7 m be the value where the infimum is achieved, then 

m d (x f(l - 7 m ) /i d (x, y)) = - log(l - 7m ) j/ (3.22) 
Returning to (13.171) . and using the bound s/n > n, 

d log d + M( 7 , n) < -m d (h f(l - 7) n - 77) (3.23) 

By assumption ft > h d (ri,r2), and also 77 — > as n — > 00, hence there is 5 > such that for n 
sufficiently large 

h f(l - 7) n - 77 > fr d (n, r 2 ) /(l - 7) n + 5 (3.24) 

and thus 

dlogd + M( 7 ,n,) < -m d (/i d (r 1) r 2 )/(l- 7 )r 1 + 5) (3.25) 
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Furthermore as was shown in 1 1 



, , , d(l — z 1 ) 1 , 
m'Jy) = -A L > _ 3.26 

dyy} d\ogz-y z v ; 

where z is the unique solution of z log z + (d — z) log ^Ef = V- The maximum value of z is d hence 
we obtain 

m'M > \ (3-27) 
Thus from (13.251) (applying the mean value theorem) 

d log d + M( 7 , n) < -m d (h d ( ri , r 2 ) /(l - 7 ) n) - ^5 (3.28) 
Setting 7 = 7 m we have 

m d {h d {r u r 2 ) /(l - 7 m ) ri) = - log(l - 7 m ) r 2 (3.29) 
Thus finally returning to (I2.10p we have 

d 2 \ogn + ndlogd + (n - d)M(^ m ,n) - slog(l - 7 m ) 

= d 2 (logn + log of) log(l - j m ) + (n - d) [dlogd+ M(j m , n) log(l - 7r) 

n \ n 

( Is 

< d 2 (\ogn + logrf) - r 2 dlog(l - j m ) + (n - d) ( log(l - j m ) r 2 - ^ 6 log(l - 7 m ) 

< d 2 (\ogn + \ogd) -r 2 d\og(l-j m ) - (n-d)^5 (3.30) 

where we used s/n < r 2 . Since 5 > the right side of (13.301) diverges to — oo as n — ► oo. Thus 
applying Theorem [7] yields the result. 



4 Proof of Theorem S 



Following the steps of the proof of Theorem [2] leads to 



dlogd + M( 7 , fc) < -m d (h /(l - 7) ( 



77 (4.1) 



where 



V = de m (logd + log-), e m <2dJ l -^ + 13d 2 J 1 ^ (4.2) 
e m V «fc V s fc 



The assumptions that Sk — > oo and log Sk/s k — > imply that 

77 



3/2 

as fc -> 00 (4.3) 



and hence the first term /i /(l — 7) (sfc/n*.) on the right side of ( 14.1 ft dominates 77. Since s^/n^ — > 
we must consider the behavior of md{y) as y — > 0. 
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Lemma 8 There is yo > such that y 1 rrid(y) is decreasing for all < y < y . Furthermore 

lim^M = l (4.4) 
v->0 y 

Lemma [8] will be proved in the Appendix. We now use it to analyze (14.11) . Since s k /n k — > 0, 
and using (14. 3p . it follows from (I4.4p that for any e > there is k$ such that for k > k Q , 

m d (h f(l - 7) (^) - ^ > (1 - e) h f(l - 7 ) (^) - V (4.5) 

Hence from (14.11) 

dlogd + M( 7 , k) < -(1 -e)h f(l - 7) (—) + V (4-6) 

\n k J 

Turning now to (12 . 1 0[) we have for k > 

d 2 log n k + n k d log d + (n k - d)M{^, k) - s k log(l - 7) 



< d 2 (\ogn k + logd) - — log(l - 7) 

n k 



- (n k - d) ({I - e) h f(l - 7) (-) - n + - log(l - 7)) 
V ^ n k J n k J 



d 2 (\ogn k + \ogd) - — log(l - 7) 

n k 



Sk{n k - d) 



n k 

By assumption there is 7 m such that 



((1-6)^/(1-7) -r/(^)+log(l- 7 ) 



From (14.31) it follows that there is 5 > such that for k sufficiently large 



Thus from f ]4.T[) 

d 2 log n k + n k d log d + (n k - d)M(j m , k) - s k log(l - j n 



(4.7) 



/(1-7J ( 



(1 - e) h f(l -j m ) -V — + log(l - 7m) > S (4.9) 

Sk 



< d 2 (logn k + log d) - ^ log(l - lm ) - Sk(nk d)6 (4.10) 

n k n k 



Since s k / logn k — > 00 the right side of (14. lOf) diverges to —00 as k — > 00. QED 
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5 Proof of the Hastings bound 



First we recall some of the ideas and notation from [T3]. The set of eigenvalues of a state p is 
denoted spec(p). Also A^ denotes the simplex of d-dimensional probability distributions: 

d 

A d = {( Xl ,...,x d ) CR d : Xi>0 } ^Xi = l} (5.1) 

i=i 

5.1 Random states 

The pure states V n can be identified with the unit sphere in M 2n . This provides a probability 
measure on V n , namely the normalized uniform measure which we denote a n . Thus saying that 
G V n is a random vector means that \ip) has the uniform distribution a n . 

Let \z) — {z\ ■ ■ ■ Zdn) T be a unit vector in Vdn- Then \z) can be written as a n x d matrix 
M, with entries 

Mij(z) = z (i _ 1)d+j , i = 1, . . .n, j = 1, . . . , d (5.2) 

satisfying Tr M*M = £V \z lj \ 2 = 1. Define the map G : V d „ . -> by 

= M(*)*M(z) (5.3) 

The eigenvalues of G(z) lie in A^. When \z) G Vdn is a random vector, the probability density fid, n 
of these eigenvalues is known explicitly [25], [35]: for any event A C A^ 



1 Ki<j<d i=l ^ i=l ' 



_ w] (5.4) 

where Z(n, d) is a normalization factor. We recall the following bound which was derived in 
Lemma 9 For all d, for n sufficiently large, and for any event A C A^, 

i r d i 

Vd,n(A) < JJZT[)\ exp d 2 \°g n + ( n ~ d )d\°gd+ (n- d) sup^logWi (5.5) 



»=i 



Now let C C 5d be any set which is invariant under conjugation by every unitary matrix in 
U(d). Then the event {\iJj) : Tr 2 |'0)(V ; | G C} depends only on the eigenvalues of Tr 2 |V')('0|, and 
thus its probability is determined by pd, n - For an arbitrary set C C Sd we define 

C = {VpV* : p G C, V G spec(C) = (J spec(p) (5.6) 

pec 
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Then C is invariant under conjugation by an arbitrary unitary matrix, and spec(C) = spec(C). 
Hence from (15.51) we deduce the bound 



G*(a nd )(C) < G*(a nd )(C) 
= / u djn (spec(C)) 

1 



< 



(d-1)! 
1 



exp 



exp 



d 2 log n + (n — d) d log d + (n — d) sup log 

li)Gspec(C) i=l 



d 2 log n+ (n — d) d log d + (n — d) sup Tr log p 



W; 



(5.7) 



5.2 Random embeddings yield random output states 

Let W G lZ(s,n,d) be a random embedding. Then for any pure state |0) G V, the vector |^) = 
W\4>) = UW Q \4>) is a uniform random vector in C nd . Thus <3>^(|0)(0|) = Tr 2 |V ; ) (V'l is the reduced 
density matrix of a random vector. This remains true if |0) is a random input state. To formalize 
this relation we define the map 

H : U(s, n, d)xV s ^ M d , (W, \<f>)) ^ $^(|0) (0|) (5.8) 



Lemma 10 



H*(V s , n4 x a s ) = G*{a dn ) (5.9) 

The proof of Lemma [TD] is very similar to the proof of Lemma 7 in [T3] and so we omit the details 
here. Lemma [TU1 implies that if W is chosen randomly according to the measure V Stn ^ d and \4>) is 
chosen randomly and uniformly in V s , then the eigenvalues of the matrix will have the 

distribution /^ >n , as defined above in (15. 4p . 



5.3 Typical channels 

For a random embedding W 'most' output states of the channel $^ are close to the maximally 
mixed state. More precisely, an embedding W will be called typical if $^ maps at least one half 
of input states into a small ball centered at the maximally mixed output state I/d. The ball is 
defined as follows: 



BJn) 



P^S d 



d 



< 2 



logn 



?? 



(5.10) 



Definition 11 An embedding W is called typical if with probability at least 1/2 a randomly chosen 
input state is mapped by $^ into the set B d (n). The set of typical embeddings is denoted T : 



T 



W 



^(\<f>){<f>\)eB d (n))>l/2 



(5.11) 
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As the next result shows, for large n most embeddings are typical. This result was proved in 
and we just quote the result here (note that in [H] the definition of B d {n) contained a free 
parameter b which was required to be at least - here we have set 6 = 2). 

Lemma 12 For each d > 2 taking n sufficiently large, and for all s, 

„ , m „, 2 c? r , 9 , , 4(n — d) 

V s , n , d {T c ) < — — exp[-« d 2 logn], a = K — ' - 1 (5.12) 

Thus as n — > oo with high probability a randomly chosen embedding will lie in the set T. In 
particular V Stn ,d(T c ) < 1 for n sufficiently large. 

The next result says that for a typical embedding W there is a fixed fraction of input states 
which are mapped by into the tube at any output state p. This result is crucial for the proof 
and differs in some significant ways from the related proof in [T3], thus we include full details in 
the Appendix. 

Lemma 13 Let d, s > 2, then for n sufficiently large, for all W G T and p G Im($^) 

a s (\4>) : $^(|0>(0|) GTube(p)) >-\l-jj (5.13) 

5.4 The proof 

Define 

E k = {{W,\(j>)) :WeB k , $%{\<f>)(<f>\) eTube(C(s k ,n k ,d))} (5.14) 

The proof will proceed by proving upper and lower bounds for the probability of E k , that is 
(V s , n ,d x a s )(E k ). 

For the upper bound, note that by Lemma [TUl 



{V s ,n,d x a s ){E k ) < {V s , n , d x a s ){{W, |0» : $^(|0)(0|) G Tube(C(s k , n k , d))} 

= (V s ,n,d x a s )(i/- 1 (Tube(C( Sfc ,n fc ,rf)))) 

= H*(V s ,n, d x a s )(Tnhe(C(s kl n kl d))) 

= G*(a dn )(Tabe(C(s k ,n k ,d))) (5.15) 



Recall the definition of M(j, k) in (12.111) . Using the bound (15.71) we deduce 

1 

Jd^Ty. 



{V s ,n,d x o- s )(E k ) < 1 exp d' z \ogn k + (n k -d)d\ogd+ (n k -d)M(-f,k) 



= a(d) exp d 2 log n k + n k d \ogd + (n*. — d)M(7, fc) (5.16) 
where a(cf) = exp[— d 2 logrf]/(<i — 1)! . 
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The derivation of the lower bound is very similar to that in [TJ], however we include it here for 
completeness. First we write 

{V s , n4 xa s ){E k ) = E w [l Bk a s {\<j)) : $&(|0>(0|) G Tube{C{s k ,n k ,d)))] 

> E w [l BknT a s (\<f)} : $&(\<f>)(<f>\) eTube(C(s k ,n k ,d)))\ 

where K w denotes expectation over 1Z(s k ,n k ,d) with respect to the measure V s ^d-, and l.s fc nT is 

the characteristic function of the event B k R T. Given that W G B k there is a state \0) G C s such 
that 

^(|0)(0|)eC(s fc ,7i fc ,d) (5.17) 
Since Tube($&(|0)(0|)) C Tube(C(s fe , n fc , d)) it follows that 

(P S , M x a s )(E k ) > E w [l BknT a s (\4>) : $£(|0}<0|) G Tube(<J>^ (|0><0|)))] (5.18) 
Applying Lemma [131 to (I5.18P gives 

(V s , njd x a s )(E k ) > \( 1 ~ , y) M^nr] (5.19) 



1 



s k -l 



4 ( 1 - 7 ) Vs, n4 (B k n T) (5.20) 
> i( 1_ 7) (Ps,nA B k) ~ ?\m(T c )) (5.21) 

Putting together the upper and lower bounds for (V s>n ,d x &s){.E k ) produces the following bound: 
for all d > 2, for all < 7 < 1, and for n sufficiently large 



V s ,n,d\B k ) — V S)n ,d{T c 



I \ s fe — 1 



< 4 (73^) (^WX^)W (5-22) 



j \ Sfc — 1 



< 4a (d) ( exp d \ogn k + n k d\ogd + (n fc — d)M0y, k) 

VI -7/ L 

= 4a (d) (1 — 7) exp [<i 2 logn fc + n fc d logd + (n k — d)M(y, k) — s k log(l — 7) 
Note that Lemma fT2l implies V s ,n,d(T c ) — > as k — > 00. Also, by assumption there is 7 such that 

d 2 \ogn k + n k d\ogd + (rife - d)M(y, k) - s k log(l — 7) — > -00 (5.23) 
as k — > 00. By choosing this value for 7 we deduce that V s ,n,d{B k ) — > as required. QED 
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A Proof of Theorem 



5 



The bound for S'min^ ® *£) is obtained using a Kraus representation = ^f=i ApA and the 
maximally entangled state. Let and |0) be the maximally entangled states on C s (g> C s and 
C n (g> C n respectively. Then 

d 

(*®5)(h?>$|)= E(a®am^I(4*®4) ( A - x ) 



^(AA* ® /)|0><0|(AA* ® /) + ]T(A ® AMMK* ® 4) ( A - 2 ) 

i=l i^j 



where we used the identity (A^A)\f) = y/n/s {AA* ® I)\<f>) . Note that J2i AA = $(-^)> therefore 

d 

TrA A = Tr$(/ S ) = Tr/ S = s 



Also 



hence 



(0|(AA®W> = -TrA4*] 

n 



d s 2 

2 - s 



Hence 



£(AA*®')i^l(AA*®')l0> = 4E(Tr[AA*ir > , 2 
t=i i=i 



(0|($®$)(|^|)|0)>£ (A.3) 



This shows that one of the eigenvalues of (<3> ® (^l) is larger than or equal to p — s/(dn). 

The rank of this matrix is d 2 , hence it has at most d 2 — 1 other nonzero eigenvalues. Given that 
sd > n, the entropy is maximized when these other eigenvalues are equal to (1 — p)/{d 2 — 1). This 
implies that the entropy cannot be larger than 



S[($® < g(p) = -plogp- (1 -p)log 



1 — p 

d?-l 



B Proof of Lemma LT31 



The result is very similar to the proof of Lemma 11 in [Hj, but with important differences in detail. 
Let \ip) be a fixed state in V s , and let \9) be a random pure state in V s , with probability distribution 
a s . We write x = (ip\0), and let \4>) be the state orthogonal to \ip) such that 

\9)=x\lp) + y/l-\x\2\<l>) (B.l) 
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Thus |0) is also a random state, defined by its relation to the uniformly random state \9) in (IB.lj) . 
The following result was proved in [14] . 



Proposition 14 x and \<p) are independent. \<p) is a random vector in V s _i with distribution er s _i. 
For allO<t<l 

a s {\9) : \(rf,\9)\ = |a:|>t} = (l-t 2 r 1 (B.2) 

Proposition [T4l implies that as s —>■ oo the overlap x = (ip\0) becomes concentrated around zero. 
In other words, with high probability a randomly chosen state will be almost orthogonal to any 
given fixed state. As a consequence, from (IB. II) it follows that \(f>) will be almost equal to \8). This 
statement is made precise by noting that 

\\\e)-m\2<y/2\{m\ (B-3) 

Then ( IB. 21) immediately implies that 

*.(\0) ■■ \\\0)-\4>)h>t)<(l-^) (B.4) 

The second property relies on the form of the conjugate channel If the Kraus decomposition 
for $ is = Ylt=i AipA* then the Kraus decomposition for $ c is 

d 

^ c (p)=J2MA k pA^)\k)(l\ (B.5) 

k,l=l 

For any fixed channel $ and random state \6), with high probability the norm of the matrix 
<& c (\9) is small, and approaches zero as n — > oo. We will prove the following bound: for any 
$ e K(s, n, d), and for all < t < 1, 

1 - (£) ) (b - 6) 

As a first step toward deriving ( IB. 61) . note that for any vectors \u) and \v), 

\\<S> c (\u)(v\)\\ 2 = (j2\(v\A*A k \u)A <rfmax|( V |^ fc |n)|. (B.7) 



\k,l=l 



Since 52t=i A*Ai = I it follows that ||^4j||oo < 1 for all i = 1, . . . , cf, which implies that 

||$ c (| M )(t;|)|| 2 <rf||| M )|| 2 ||H|| 2 (B.8) 
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To derive (IB. 61) we apply (1B.7j) with u = 9 and v = ip and deduce that 



: ||$ c '(l^l)l| 2 >tJ < a s (j0> : max | ^\A*A k \6) \ > - 



< d 2 a s [\6) : MA*A k \9)\>~ 



< d 2 1 



(B.9) 



where the last equality follows from (1B.2|) . Note that for each k,l, the above \A* k Aiip) is a fixed 
vector with norm less than 1. 

With these ingredients in place the proof of Lemma [13] can proceed. By assumption $ is a 
channel belonging to the typical set T, and p = $ (IV'XV'I) is some state in Im($ c ). Let \9) be a 
random input state, then as in (IB. II) we write 



It follows that 



\9)=x\ip) + v 7 ! - M 



\x\ 2 IVXVI + (1 - |af ) |0) (0| + ^1-N 2 (a; + x\<p)(^\) 



(B.10) 



Write r = |x| 2 , then (IB. 101) yields 



<£ c (|^|) -( r ^(\iP)(iP\) + (l-r)-I 



d 



(1-r) 



+ e-*|0>(V>i)(B.ll) 



where £ is the phase of x. Since r < 1 this implies 



^(\6)(6\)-(r^(\^)^\) + (l-r)-I 



< 



® c mm-- d i 



® c {\m\) 



(B.12) 



Referring to the definition (12.71) of Tube(p), recall that & c (\9)(9\) belongs to Tube(p) if and only 
if for some r satisfying 7 < r < 1, 



$ c (\6)(6\)- (r$ G '(|^)(^|) + (l-r)3/ 



,0 



/logn , \ogd 

<2\l— — + 13d 1 



n 



(B.13) 



Define the following three events in V s : 



A x = 


{\9):r 


A 2 = 


[m ■■ 


A 3 = 


[m ■■ 



>7} 



< 



+ 





(B.14) 


logn | 


(B.15) 


n J 




(B.16) 
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It follows from flRl2|) and (|RT3|) that 

A x n A 2 n A 3 C {|0> : <K C '(|fl>(#|) G Tube(p)} 
Furthermore by Proposition [TH is independent of A 2 and A3, hence 



a s (<S> c (\9)(6\) e Tube(p)) > ^(^ n A 2 n A 3 ) = a s {A x ) a s {A 2 n A 3 



Proposition [14] immediately yields 



a s (A 1 ) = (l- 7 r 1 



From (lB.18j) this gives 

a s £>(£|) e Tube(p)) > (1 - 7) 5 - 1 (1 - <r.{A® - *.(J®) 
In order to bound a s {A^) we first use (IB. 8|) to deduce 

ll$ C (l^)(0l)Hoo< ll^(l^)(0l)l|2<||$ C (|^)(^|)|| 2 + rf|||^)- 

Thus 



<r.(^) = <xA\6) 



I Qd 2 log d + 1 12d 2 log d 



< <x s ||0> : ||$ C (|^)^|)|| 2 + rf|||^)-|0)|| 2 > 

< °s{\0) : 



6d 2 log d /12d 2 logd 



2 > 



1 



< (d 2 + l) 1 



61ogd\ s - 1 



where the last inequality follows from (IB.9I) and (IB.4|) . 
Turning now to a s {A^), note first that 



* a {\m\)--j 



< 



< 



$ c {\<t>){<t>\)-$ c {\e)(e\) 

® c {\m\)-® c {\6)(6\) 



$ c {\e)(e\)-\i 
$ c {\e)(o\)-\i 



< 2dp)-|0lla + 



^(\e)(9\)--i 
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where we used ( 1B.8j) for the last inequality. As in (1B.22j) this gives 
a s {A c 2 ) 



$ C (I0)(0D- -J 



d 



48c/ 2 log 2 /log w 



+<x s j|0> : 
6 log d x 1 



2 > 



12 log d 



> 2 



logn 



n 



< 1- 



vs{ \9) : 



$C(\e)(0\)--I 



> 2 



logn 



n 



(B.25) 



where we used (1B.4|) for the last inequality. By assumption <3> G T, and therefore there is a set of 
input states L with cr s (L) > 1/2 such that 



|0)6L 



$°(|e>(fl|)-ij 



< 2 



logn 



n 



(B.26) 



Thus 



^{\e){e\)-- d i 



>2,W<cr.(L«)<i 



(B.27) 



Putting together the bounds flB~20|) . flB~22l . flB~25|) and flB~27j) we get 



<r.(* c (|0>(0|)eThbe(p)) > (l- 7 )-^l-a s (A c 2 )-a s (A c 3 

s-1 

/ / I I I / ! ( V / / \ 

\S-1 



> (1-7)' 



1-1 



Q\ogd 



1 { ^ + 1) ( 1 -^ d v" 

2 V s 



= (i- 7 r 1 Q-(^+2)(i 

The proof now follows by noting that for all d, s > 2 

(d 2 + 2) (l 



6 loe;d\ ^ 



(B.28) 



Glogdy- 1 1 
s ) ~ 4 



and hence 



1 



( T s ($ c '(|^)^|)GTube(p))>^(l- 7 r 1 



(B.29) 



(B.30) 
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C Proof of Lemma [8 



We use properties of the function derived in Section 5.7 of [Hj. We have 



d — z 

m<t{y) = g(z), g(z) = - log z - (d - 1) log — - (C.l) 



where z = h 1 (y) and 



h(z) =z log z + (d-z) log^j— j (C.2) 



As was shown in [13] both functions g, h are increasing, and h(l) = 0. Since the functions are 
analytic, their behavior near z = 1 is determined by their power series expansions at z = 1. To 
leading order these are 

"< 1 + i > = (^T)) t2 -(^) t3 + °( t4 ) < a3 > 

Setting y = /i(l + £), solving the second series for t in terms of y, and substituting into the first 
series gives 

/ d 2 -2d \ /2(rf-l)\ 3/2 3/2 
™d(y) =y-[ o/ J 7v? J z/ 3/ + ■ ■ • (C4) 



3(d-l)V V d 

Thus for sufficiently small y we have m ( i(y)/y ~ 1 — k^/y, which implies both statements in Lemma 

E 
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